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(57) ABSTRACT 

A system and method are described for providing improved 
cache memory management. Broadly, the system and 
method improve the performance of an asynchronous input/ 
output (1/0) cache by ensuring that a certain predetermined 
amount of space is readily available, at all times, to receive 
new data. In this regard, a memory manager monitors the 
cache memory space, and evaluates how much *^£ree'* or 
available space exists at all times. As new data is read into 
the cache memory space, the amount of "free" space is 
reduced. Once the free spaced is reduced below a predeter- 
mined amount, then one or more cache lines are flushed or 
discarded to ensure that the predetermined amount of space 
remains available at all times. Significantly, the system and 
method eliminate the latency that is associated with check- 
ing a cadie to determine whether free space is available 
and/or freeing up space in a cache for new data. 

15 Claims, 5 Drawing Sheets 
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SYSTEM AND METHOD FOR MANAGING 
DATA IN AN ASYNCHRONOUS I/O CACHE 

MEMORY TO MAINTAIN A 
PREDETERMINED AMOUNT OF STORAGE 
SPACE THAT IS READILY AVAILABLE 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention generally relates to nnemory man- 
agement systems, and more particularly to a system for 
managing data in an asynchronous input/output (I/O) cache 
memory. 

2. Discussion of the Related Art 

In computer system design, a principal objective is to 
continually design faster and more efficient computer sys- 
tems. In this regard, most conventional high-performance 
computer systems include cache memories. As is known, a 
cache memory is a high-speed memory that is positioned 
between a microprocessor and main memory in a compiiter 
system in order to improve system performance. Typically, 
cache memories (or caches) store copies of portions of main 
memory data that are actively being used by the central- 
processing unit (CPU) while a program is running. Since the 
access time of a cache can be faster than that of main 
memory, the overall access time can be reduced. 

Cache memory is used in data storage systems so that 
under typical conditions, a relatively large and usually slow 
main memory can be accessed at a relatively high rate. The 
cache memory is a relatively small high-speed memory that 
can store, at any given time, a portion of the data stored in 
the main memory. The cache memory also includes a cache 
directory or index of the data elements stored therein. The 
cache directory is referenced to provide an indication of 
whether or not each data element is located in the cache 
memory at any given time, and if so, the present location of 
the data element in the cache memory. 

A data storage system is typically responsive to data 
access requests from a host computer. The data access 
requests, for example, include read and write requests. When 
a data storage system having a cache memory receives a 
request for access to data, the cache directory is inspected to 
determine whether the data element to be accessed resides in 
the cache memory. If so, the data storage system accesses the 
data in the cache memory. If not, the data storage system 
accesses the data in the main memory, and if the accessed 
data is likely to be accessed again in the near future, the 
accessed data is copied into the cache memory. 

Although processor caches are perhaps the best known, 
other caches are known and used as well. For example, I/O 
caches are known for buffering and caching data between a 
system bus and an I/O bus. As will be further described 
below, certain system components, like a microprocessor 
and memory, are synchronized off a different clock than I/O 
transactions. When passing data between two differing fre- 
quency domains, it is usually desirable, if not necessary, to 
buffer the data in some way. One way that this is done is by 
passing the data through an I/O cache. 

Because the cache memory has a capacity that is usually 
much smaller than the main memory, it is often necessary for 
data elements in the cache memory to be replaced or 
removed from the cache memory in order to provide space 
in the cache memory for new receiving data elements to be 
copied into the cache memory. In general, for the cache 
memory to be useful, the data elements replaced or removed 
from the cache memory must be less likely to be accessed in 
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the near fiiture than the data elements that are staged into the 
cache memory. This process is conventionally known as 
cache management. 
A wide variety of methods and strategies are known for 

5 managing data within a cache memory to achieve optimum 
system performance. These methods often vary from system 
to system, and from application to application. Management 
methods may vary depending upon whether the cache 
memory is an electronic (e.g., integrated circuit) memory 

jQ that is caching data on a disk drive (main memory), or 
whether the cache memory is a high speed electronic 
memory that is caching data for a slower speed electronic 
memory (main memory). Management methods may further 
vary depending upon the size of the cache memory. 

15 Notwithstanding the various cache memory management 
methods, generally, when space is available within the cache 
memory for new data, the new data is simply copied into the 
available space, and logged in the directory/index of the 
cache. When, however, no additional space is available for 

20 new data then existing data must be discarded to make room 
for the new data. Also, the directory/index must be updated 
accordingly. As previously mentioned, there are a variety of 
methods that are known and implemented to determine 
which data within the cache to discard, each of which seeks 

25 to discard data that will not likely be used again (at least in 
the near term). 

Notwithstanding the various known methods for cache 
memory management, further improvements are desired. 

SUMMARY OF THE INVENTION 

30 

Certain objects, advantages and novel features of the 
invention will be set forth in part in the description that 
follows and in part will become apparent to those skilled in 
the art upon examination of the following or may be learned 

35 with the practice of the invention. The objects and advan- 
tages of the invention may be realized and obtained by 
means of the instrumentalities and combinations particularly 
pointed out in the appended claims. 
To achieve the advantages and novel features, the present 

40 invention is generally directed to a system and method for 
providing improved cache memory management. Broadly, 
the present invention improves the performance of an asyn- 
chronous I/O cache by ensuring that a certain predetermined 
amount of space is readily available, at all times, to receive 

45 new data. In this regard, a memory manager (dedicated 
hardware within the cache) monitors the cache memory 
space, and evaluates how much "free" or available space 
exists at all times. As new data is read into the cache memory 
space, the amount of "&ee" space is reduced. Once the free 

50 spaced is reduced below a predetermined amount, then one 
or more cache lines are discarded to ensure that the prede- 
termined amount of space remains available at all times. 

In accordance with one aspect of the invention, the 
invention includes a method for managing data within the 

55 cache memory composing the steps of identifying a request 
for data from the system memory by a device, wherein the 
data requested is not presently in a data storage area of the 
cache memory, and reading the data from the system 
memory into the data storage area of the cache memory. The 

60 method then evaluates the cache memory to determine 

) whether at least a predetermined amount of space within the 
data storage area is readily available. Then the method 
flushes a portion of the data within the data storage area, 
only if at least the predetermined amount of space is not 

65 readily available. The foregoing steps are then repeated. 
Importantly, the step of reading the data from the system 
memory into the data storage area occurs inunediately after 
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the step of identifying a request for data. In this regard, by FIG. 5 is a block diagram illustrating a cache memory/ 

virtue of the present invention, it may success&Uy be memory manager constructed in accordance with the present 

assumed that at least a predetermined amount of space is invention; and 

immediately available within the cache memory to receive piG. 6 is a flowchart illustrating the lop-level functional 
new data. Therefore, no latency delay is incurred, which is 5 operation of a system constructed in accordance with the 

otherwise incurred if the cache memory is full and certain present invention, 
data must be discarded. 

In accordance with the preferred embodiment, the prede- DETAILED DESCRIPTION OF THE 

termined amount of data may be programmably configured. PREFERRED EMBODIMENT 
Preferably, the acceptable range for this predetermined lo Having summarized various aspects of the present 

amount may vary from one cache line to four cache lines. invention, reference will now be made in detail to the 

Further, the step of reading data into the cache memory may description of the invention as illustrated in the drawings, 

further include the step of determining the number of data while the invention will be described in connection with 

bytes to be read into the cache memory (if more than one), these drawings, there is no intent to limit it to the embodi- 

and immediately reading all the requested data bytes into j^g^ embodiments disclosed therein. On the contrary, the 

available cache memory space. intent is to cover all alternatives, modifications and equiva- 

In accordance with another aspect of the present lents included within the spirit and scope of the invention as 

invention, a method is provided for managing data within a defined by the appended claims, 
cache memory, wherein the method includes the principal 

steps of deteraaining a quantity of available space within a Cache Basics 

data storage area, and comparing the quantity of available Before discussing the novel aspects of the present 

space with a predetermined value. In addition the method invention, additional information regarding cache memories 

include the step of flushing at least one line of data from the will first be presented. As previously mentioned, a cache 

cache memory if the available space is less than the prede- memory is a high-speed memory that is positioned between 

termined value. In accordance with one embodiment the step a microprocessor and main memory in a computer system in 

of comparing includes evaluating a flush control register that order to improve system performance. Cache memories (or 

specifies the predetermined value, caches) store copies of portions of main memory data that 

In accordance with yet another aspect of the present are actively being used.by the central processing unit (CPU) 

invention, a system is provided for managing data in a cache while a program is running. Since the access time of a cache 

memory. The system includes a memory space comprising a can be faster than that of main memory, the overall access 

plurality of data lines within the cache memory, and a time can be reduced. 

memory manager configured to manage the data within the Many micioprocessor-based systems implement a "direct 

memory space. The memory manager further includes first mapped" cache memory. In general, a direct mapped cache 

means for determining a quantity of available space within memory comprises a high-speed data Random Access 

a data storage area, second means for comparing the quantity Memory (RAM) and a parallel high-speed tag RAM. The 

of available space with a predetermined value, and third RAM address of each line in the data cache is the same as 

means for controllably flushing at least one line of data firom the low-order portion of the main memory line address to 

the cache memory if the available space is less than the which the entry corresponds, the high-order portion of the 
predetermined value. ^ main memory address being stored in the tag RAM. Thus, if 

Preferably, the first means includes a circuit configured to main memory is thought of as 2"* blocks of 2" 'lines" of one 

evaluate a directory containing addresses of data stored or more bytes each, the i* line in the cache data RAM will 

within the data storage area. Similarly, the second means be a copy of the i* line of one of the 2" blocks in main 

preferably includes a circuit having a flush control register, memory. The identity of the main memory block that the line 
wherein the flush control register includes a plurality of bits 45 came from is stored in the i''' location in the tag RAM. 

(programmably variable) that specify the predetermined When a CPU requests data from memory, the low-order 

value. Finally, the third means includes a circuit configured portion of the line address is supplied as an address to both 

to flush at least one line of data from the data storage area, the cache data and cache tag RAMs. The tag for the selected 

to maintain at least the predetermined value of cache lines cache entry is compared with the high-order portion of the 
available to receive new data at all times. 50 CPU*s address and, if it matches, then a "cache hit" is 

indicated and the data firom the cache data RAM is enabled 

DESCRIPTION OF THE DRAWINGS onto a data bus of the system. If the tag does not match the 

TTie accompanying drawings incorporated in and forming I'^Sb-o'der portion of the CPU;s address, or the tag data is 

a part of the specification, illustrate several aspects of the f vdid then a cache miss is indicated and the data is 
present invention, and together with the description serve to « itched from mam memory. It is also placed m the cache for 

explain the principles of the invention. In the drawings: P°^^°^lf ^^"'^ , ^^.^ overwriting the previous entry 

L,^ ^ . . , , .„ , ^ , . Typically, an entire fine is read from main memory and 

FIG. 1 ^ a block diagram Illustrating fiindamental com- ^^^^ ^ ^he cache on a cache miss, even if only a byte is 

ponents of a cache memory device; requested. On a data write from the CPU, either the cache 

FIG. 2 is a block diagram of an I/O cache system ^ RAM or main memory or both may be updated, it being 

constructed in accordance with a preferred environment of understood that flags may be necessary to indicate to one 

the present invention; that a write has occurred in the other. 

FIG. 3 is a block diagram of a device including an I/O Accordingly, in a direct mapped cache, each "line" of 

cache, in accordance with a preferred environment of the secondary memory can be mapped to one and only one line 
present invention; 65 in the cache. In a "fiilly associative" cache, a particular line 

FIG. 4 is a schematic diagram of a synchronization circuit of secondary memory may be mapped to any of the lines in 

illustrated in FIG. 3. the cache; in this case, in a cacheable access, all of the tags 
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must be compared to the address in order to determine master asks the cache memory to indicate whether it has a 

whether a cache hit or miss has occurred, "n-way set more current copy of the data. 

associative" cache architectures also exist which represent a FIG. 1 is a block diagram that illustrates certain fiinda- 

compromise between direct mapped caches and fiiUy asso- mental components of a cache memory 50. These funda- 

ciative caches. In an n-way set associative cache 5 mental components include a plurality of tags 52, a corre- 

architecture, each line of secondary memory may be mapped sponding plurality of data lines 54, and a directory/index 56. 

to any of n lines in the cache. In this case, k tags must be As is known, cache tags 52 typically comprise a portion (the 

compared to the address during a cacheable secondary most significant bits) of the address bits that define blocks 

memory access in order to determine whether a cache hit or (lines) of memory location(s) of target data. The least 

miss has occurred. Caches may also be "sector bufiered" or jq significant bits (represented as don't care values) identify the 

"sub-block" type caches, in which several cache data lines, specific position (within the blocks) to a target data value in 

each with its own valid bit, correspond to a single cache tag the corresponding data line 54. In accordance with the 

RAM entry. preferred embodiment of the present invention, the cache 

When the CPU executes instructions that modify the memory 50 utilizes sixty-four byte daU lines 54. Therefore, 

contents of the cache, these modifications must also be made 15 six le^ signifiwnt bits of the address bits specify the 

in the main memory or the data in main memory will PJiUcular location of a t^^^ 

become "stale." There are two conventional techniques for ^. ^ . , . . 
keeping the contents of the main memory consistent with . '""^'"^ ™- \ f ? ^^^l^f S6. As is 
that of the cache-(l) the write-through method and (2) the n'^ 56 mamtams a og or database of the 
. , ^ \. , .1. J 7 .1. u various tags 52. Upon start-up or mitialization, there is no 
write-back or copy-back rnethod. In the wnte-through data stored within the cache memory 50. and Iherefore no 
method, on a cache write hit, da a is written to the main ^ ^ ^^^^ directory 56. Therefore, when a device 
memory unmediately after or while data is written mto the requests data to be read from system memory, an entire Une 
cache. This enables the contents of the mam memory always 54 ^ ^ad firom the system memory into a data line 54 of the 
to be valid and consistent with that of the cache. In the cache memory 50. The most significant bits of the address 
write-back method, on a cache write hit, the system writes 25 bus are written into the corresponding tag 52, which is stored 
data into the cache and sets a "dirty bit" which indicates that in the directory 56. Upon the next request for data from a 
a data word has been written into the cache but not into the specified memory location, the cache memory 50 will exam- 
main memory. A cache controller checks for a dirty bit ine the directory 56 to determine whether that data value is 
before overwriting any line of data in the cache, and if set, presently within the cache. If the data value is, in fact, within 
writes the line of data out to main memory before loading 30 cache memory 50, then it may be accessed and read out 
the cache with new data. at high speed. If, however, the data value is not presently 
A computer system can have more than one level of cache stored within the cache memory 50, then it is retrieved from 
memory for a given address space. For example, in a system memory and read into an available data line 54 
two-level cache system, die "level one" (LI) cache is of the cache memory 50. As before, the most significant bits 
logically adjacent to the host processor. The second level 35 of the address bus are written into a corresponding tag, and 
(L2) cache is logically behind the first level cache, and logged in the directory 56. 

DRAM memory (which in this case can be referred to as The foregoing process is repeated for subsequent requests 

tertiary memory) is located logically behind the second level for data from system memory. At some point, however, all 

cache. When the host processor performs an access to an data lines 54 of the cache memory 50 will be occupied. At 

address in the memory address space, the first level cache 40 this point, the cache memory 50 must discard at least one 

responds if possible. If the first level cache cannot respond line 54 of data, in order to make space available for new data 

(for example, because of an LI cache miss), then the second to be read into the cache 50. As previously mentioned, there 

level cache responds if possible. If the second level cache are a wide variety of algorithms and methods that are known 

also cannot respond, then the access is made to DRAM fordetermining which line or lines 54 of data to discard from 

itself. The host processor does not need to know how many 45 the cache 50. Preferably, an algorithm is chosen that will 

levels of caching are present in the system or indeed that any discard those data lines 54 which are least likely to be used 

caching exists at all. Similarly, the first level cache does not again in the near fumre. As data is discarded from the cache 

need to know whether a second level of caching exists prior 50, the directory 56 is updated accordingly to remove the 

to the DRAM, Thus, to the host processing unit, the com- corresponding tag(s) 52 from the directory space. ^ 

bination of both caches and DRAM is considered merely as 50 An efficient cache yields a high "hit rate" which is the 

a single main memory structure. Similarly, to the LI cache, percentage of cache bits that occur during all memory 

the combination of the L2 cache and DRAM is considered accesses. When a cache has a high hit rate, the majority of 

simply as a single main memory structure. In fact, a third memory accesses are serviced with zero wait states. The net 

level of caching could be included between the L2 cache and effect of a high cache hit rate is that the wait states incuned 

the actual DRAM, and the L2 cache would still consider the 55 on a relatively infrequent miss are averaged over a large 

combination of L3 and DRAM as a single main memory munber of zero wait state cache hit accesses, resulting in an 

structure. average of nearly zero wait states per access. Also, since a 

Computer system cache memories typically cache main cache is usually located on the local bus of the 

memory data for the CPU. If the cache uses a write-back microprocessor, cache hits are serviced locally without 

protocol, then frequently the cache memory will contain 60 requiring use of the system bus. Therefore, a processor 

more current data than the corresponding lines in main operating out of its local cache has a much lower "bus 

memory. This poses a problem for other bus masters (and for utilization." This reduces system bus bandwidth used by the 

other CPUs in a multiprocessor system) desiring to access a processor, making more bandwidth available for other 

line of main memory, because it is not known whether the devices, such as intelligent bus masters, which can indepen- 

main memory version is the most current version of the data. 65 dently gain access to the bus. 

Write-back cache oontroUers, therefore, typically support Although processor caches are perhaps the best known, 

inquire cycles (also known as snoop cycles), in which a bus other caches are known and used as well. For example, I/O 
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caches are known for buffering and caching data between a interface card, or video graphics card, might alter the con- 
system bus and an I/O bus. As will be further described tents of a main memory location that is duplicated in the 
below, certain system components, like a microprocessor cache. When this occurs, the cache is said to bold ^'stale/' 
and memory, are synchronized off a different clock than I/O ''dirty" or invalid data. Also, when the processor executes a 
transactions. When passing data between two differing fre- 5 cache write hit operation to a write-back cache, the cache 
quency domains, it is usually desirable, if not necessary, to receives the new data, but main memory is not updated until 
buffer the data in some way. One way that this is done is by g later time, if at all. In this instance, the cache contains a 
passing the data throu^ an I/O cache. "clean" or correct version of the data and is said to own the 
Cache management is generally performed by a device location, and main memory holds invalid or "stale" data, 
referred to as a cache controller. One cache management ^ Problems would arise if the processor was allowed to access 
duty performed by the cache controller is the handling of stale data from the cache, or if a bus master was allowed to 
processor writes to memory. The manner in which write access dirty data from main memory. Therefore, in order to 
operations are handled determines whether a cache is des- maintain cache coherency, i.e., in order to prevent a device 
ignated as "write-through" or 'Svrite-back." When the pro- such as a processor or bus master from inadvertently receiv- 
cessor initiates a write to main memory, the cache is first ^5 incorrect or stale data, it is necessary for the cache 
checked to determine if a copy of the data from this location controller to monitor the system bus for bus master accesses 
resides in the cache. If a processor write hit occurs in a to main memory when the processor does not control the 
write-back cache design, then the cache location is updated system bus. This method of monitoring the bus is referred to 
with the new data, and main memory is only updated later ^3 snooping. 

if this data is requested by another device, such as a bus in a write-back cache design, the cache controller must 

master. Alternatively, the cache maintains the correct or monitor the system bus during memory reads by a bus 

"clean" copy of data thereafter, and the main memory is only because of the possibility that the cache may own the 

updated when a flush operation occurs. In a wnte-through j ^ ^ jhe cache may contain the only correct copy of 

cache, the mam memory location is generally updated in j^t, location, referred to as modified data. This is 

conjunction with the cache location on a processor wnte hit. ^5 to as read snooping. On a read snoop hit where the 

If a processor wnte miss occurs to a wnte-through cache, the ^^^^ ^^difled data, the cache controller generally 

cache controller may either ignore the wnte mass or may the respective date to main memory, and the 

perforin a wnte-allocate. whereby the cache controller requesting bus master generally reads this daU en route &om 

aUocates a new Ime m the cache m addition to passmg the controller to main memory, this operation being 

data to the main memory In a wnte-back cache design, the 30 ^ ^^^^^ Altemalively, the cache controller 

cache controller generally allocates anew hne m the cache the respective daU ditecUy to the bus master and 

when a processor wnte miss occurs. Tliis generally mvolves ,„ ^ ,„ ,^{5 alternative scheme, the main 

reading the remaming entnes from mam memory to fiU the ^^uld perpetually contain erroneous or "stale" data 

line in addition to allocating the new wnte data. ^^ii , g^^ijg g^^^ occurred. 

Tlic cache controller includes a directory that holds an 35 ^^^^^ write-back and write-through cache designs, the 

associated entry for each set in the cache. In a wnte-through ^j.^.^^ controller must also monitor the system bus during 

cachMhisentry generally hasthree compone^ a tag ^us master writes to memory because the bus master may 

valid bit. and a number of hne valid bits equahng 4e number ^^te to or alter a memory location having data that resides 

of Imcs m each cache set. The Ug acts as a mam memory j„ jhe cache. This is referred to as write snooping. On a write 

page number, and it holds the upper address bits of the 40 snoop hit to a write-through cache, the cache entry is 

particularpage m mammemory&om which thecopy of data ^^^^^ ^^^^^ ^ ^y. ^^^^^^ ^^^^ ^ ^^^^^ 

residmg in the resp«:tiye set of the cache origmated. The controUer. signifying that this entry is no longer correct. In 

status of the tag vahd bit detennines whether the data m the ^ write-back cache, the cache is updated along with main 

respective set of the cache is considered vahd or mvalid. If , ^,t^ ^its are set to indicate that the 

the Ug valid bit is clear then the entire set is considered 45 respective cache location now includes a clean copy of the 

mvahd. If the tag vahd bit is true, then an individual line ^ata. Altematively, a write-back cache may invalidate the 

withm the set is considered vahd or mvahd depending on the g„ti,g , Therefore, in a write-back 

status of Its respective hne valid bit. In a wnte-back cache ^ jg^j ^ache controUer must snoop both bus 

the entnes m the cache directory are generaUy comprised of ^ ^^ ^ I„ , write-through 

a tag and a number of tag slate bits for each of the hnes in 50 ^ ^ eontioUer need only snoop bus 

each set. As before, the tag compnses the upper address bits ^^^^^ memory 

of the particular page in main memory from which the copy . . n . .t u 

• • . J Tu . , . J . • .u ... ff.u J . The process of snooping generally entaib that the cache 

onginated. The tag state bits determme the status of the data _ i.t.u . u jj j u 

c u .• 1- u .u .1. J . • • I'j controller latch the system bus address and perform a cache 

for each respective hne, i.e.. whether the data is mvaM, look-up in the tag directory conesponding to the page ofl&et 

modified (owned) or clean. 55 location where the memory access occurred to see if a copy 

A prmcipal cache managemem policy is the preservation of data from the main memory location being accessed also 
of cache coherency. Cache coherency refers to the require- ■ ^^^^^ i„ ,5,^ ^^he. If a copy of the data from this location 

inenl that any copy of data in a cache must be idenUcal to (or ^ocs reside in the cache, then the cache controller takes the 

actuaUy be) the owner ofthatlocaaonsdata. The owner of appropriate action depending on whether a write-back or 

a location's data IS geoeraUy defined as the respective eo write-through cache design has been implemented, or 

location havmg the most recent or the correct version of ^^ether a read or write snoop hit has occuned. TTiis prevents 

data. The owner of data is generally either an unmodified incompatible or inconsistent data from being stored in main 

location in mam memory, or a modified location m a memory and the cache, thereby preserving cache coherency 
ivriie^DacK cacne. 

In computer systems where independent bus masters can 65 Environment of the Present Invention 

access memory, there is a possibility that a bus master, such Having described certain basic aspects of cache memory 

as a direct memory access controller, netwoik or disk devices,thediscussion will now be directed to the operating 
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environment of the preferred embodiment of present inven- Advantageously, this minimizes the chip space required for 

tion. In this regard, the present invention is directed to a accommodating duplicative data and address storage areas, 

system and method for managing the data within a cache What is duplicated, in connection with the present invention, 

memory. More specifically, the inventive method for man- is certain validity/status circuitry 124 and 126. Specifically, 

aging cache data is particularly useful in an I/O cache, or 5 circuitry 124 is provided to indicate when data within the 

some other similar environment where data is communi- data storage area 120 is valid, from the perq)ective of the 

cated from one synchronous subsystem (synchronized to a system bus 108. Likewise, circuitry 126 is provided to 

first clock) to another synchronous subsystem (synchronized indicate when data within the data storage area 120 is valid 

to a second clock). from the perspective of the I/O bus 110. As will be appre- 

Reference is now made to FIG. 2, which is a block elated from the discussion that follows, the system achieves 

diagram that illustrates a top-level view of a system 100 its advantages by essentially hiding the latency that is 

constructed in accordance with a preferred computing envi- incurred when crossing the frequency boundary 116. In this 

ronment for the invention. More particularly, the invention regard, the latency penalties are paid at non-critical times, 

comprises an apparatus 102 that is particularly configured to For example, when a fetch instruction or operation is 

be interposed between a system bus 108 and an I/O bus 110. jj executed, whereby a device seeks to fetch data from system 

As will be appreciated by persons skilled in the art, the memory 104 via the I/O bxis 110, several cycles of delay are 

system bus 108 is a grouping of conductors that provides a encountered in order to fetch the data across the frequency 

means for intercommimicating among a number of devices boundary 116. While this latency frequently is incurred for 

in a computing system, such as a memory 104, a CPU 106, each successive byte of data that is fetched, the device 102 

as well as other devices. In similar fashion, the I/O bus 110 20 effectively hides this latency by incurring it (from the 

is a grouping of conductors that provides a means for the prospective of the I/O bus 110) only in connection with the 

devices 104, 106 on the system bus 108 to intercommunicate first data byte fetched. Subsequent data bytes may be fetched 

with other, remote devices I/O. into the data storage area 120 and then retrieved by the I/O 

For example, in a computing environment such as a bus 110 with no subsequent latency periods encountered 

personal computer, a system is frequently provided having a 25 cither. Similarly, and as will be further described below, the 

motherboard that contains certain system components, such device 102 operates in a manner that allows the snoopy 

as a memory and central processing unit (CPU), on a coherency protocol to occur on the system side (between the 

motherboard. To accommodate a variety of differing "expan- system bus 108 and the device 102) without incurring 

sion" cards, a plurality of expansion slots are provided on latency delays during the critical periods, 

the motherboard as well. Such expansion cards may include 30 Reference is now made to FIG. 3, which is a block 

disk drive controllers, modems and other conmaunication diagram that more specifically illustrates the device 102. As 

cards, etc. While some cards or devices such as these may previously mentioned, the device 102 includes a data storage 

intercommunicate with the CPU and memory via die system area 120 and an address storage area 122 that, conceptually, 

bus 108, others often communicate via an I/O bus, such as straddle the frequency boundary 116 between the system 

a peripheral component interface (PCI) bus. 35 frequency domain and the I/O frequency domain. In acoor- 

As is known, the devices that intercommunicate via the dance with the inventive concepts, synchronization circuits 

system bus 108 generally do so in a synchronous fashion, 130 and 140 are provided that also, concepmally, straddle 

insofar as these devices operate from a common system the frequency boimdary 116. 

clock. This is represented graphically in the figure by dashed In this regard, reference is made to FIG. 4 which is a 

frequency boundary line 116, whereby devices on the sys- 40 schematic diagram illustrating that circuitry that may pref- 

tem side of line 116 are said to operate in a system frequency erably comprise the synchronization circuit 130. 

domain, which is the frequency of the system clock. Devices Specifically, the synchronization block 130 preferably 

that intercommunicate across the I/O bus also do so in a includes three flip-flops 132, 134, and 136, configured as 

synchronized fashion, but are synchronized to or with a illustrated. A signal that is passed from the I/O frequency 

different clock. Indeed, typically the I/O clock operates at a 45 domain (more generically denoted as the first frequency 

different frequency than the system clock. Accordingly, domain is clocked using a first clock (CLKl) into a D-type 

devices on this side of the dashed line are said to operate in flip-flop 132. The output of this flip-flop is then directed to 

the I/O frequency domain. In prior art systems, a cache or flip-flop 134, which is clocked by a second clock (CLK2), 

other buffering device interposed between a system bus and that is operating in the system frequency domain (or more 

an I/O bus was typically designed to operate in either the 50 generically denoted as the second frequency domain). A 

system frequency domain or the I/O frequency domain. As third flip-flop 136 is provided, whereby the output of flip- 

a result, certain latency penalties were paid when conducting flop 134 is directed to the input of flip-flop 136. The purpose 

operations or transactions that crossed the frequency bound- of flip-flop 136 is to overcome the potential effects of 

ary 116. metastability of flip-flop 134. Specifically, if a signal 

The preferred embodiment of the present invention oper- 55 clocked out of flip-flop 132 is clocked into flip-flop 134, 

ates in a system that overcomes these latency shortcomings without satisfying either the set up or hold requirements for 

by providing a device 102 that effectively straddles the flip-flop 134, then the output of flip-flop 134 may be 

frequency boundary. The present invention operates in a metastable (i.e., neither a logic high nor a logic low). As is 

device having a cache data storage area 120 as well as an known, this metastability will typically resolve itself within 

address (or tag) storage area 122 that straddles the frequency 60 the period of a clock cycle. Accordingly, coincident with the 

boundary 116. In this regard, it will be appreciated that the next clock cycle of CLK2, the output of flip-flop 134 will be 

frequency boundary 116 is not so much a physical boimdary settled at either a logic high or logic low, thereby providing 

as it is a conceptual boundary. When the cache data storage a valid output at the output of flip-flop 136, 

area 120 and tag/address storage area 122 are characterized Retumii^ to FIG. 3, similar synchronization circuits 140 

to straddle this boundary, it simply means that one set of data 65 and 142 are provided for signals passing the opposite 

and one set of addresses are provided within a device 102, direction from that of the synchronization circuit 130. As 

as opposed to duplicative sets of data and addresses. will be further described below, state machines 152 and 154 
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are provided to control the communication and synchroni- 
zation of signals across the frequency boundary. 

For purposes of illustration and simplification, only one 
set of signals has been illustrated in FIG. 3 for each of the 
frequency domains. However, it will be appreciated that, in 
the preferred embodiment, there will be duplicative circuitry 
and signals set for each line of data in the data storage area 
120. A device 102 constructed in accordance with the 
preferred embodiment, includes sixteen lines of cache data 
within the data storage area 120, each line containing 
sixty-four bytes of data. Accordingly, the validity/status 
circuitry partially illustrated herein will be replicated sixteen 
times. 

As should be appreciated by persons of ordinary skill in 
the art, data within the data storage area 120 should never 
appear valid from the I/O side of the device 102 and not Crom 
the system side of the device 102. As will be appreciated 
from the diagram of FIG. 3, this status is insured by the 
configuration of the Invalidate, I/O Valid, and Sys. Valid 
signal lines. Broadly, when driving signals across the fre- 
quency boundary, the predse length of time that a signal 
should be asserted is uncertain. Therefore, it is preferred to 
provide some sort of feedback with the signaling. This is 
effectively achieved through the use of the Invalidate, I/O 
Valid, and Sys. Valid signal lines. 

More specifically, state machine 154 generates both Sys. 
Valid and Invalidate signal lines, which initiate the indica- 
tion that data is either valid or invalid from the system 
frequency domain. Both of these signals are passed through 
synchronization circuits 140 and 142, respectively, and input 
to state machine 152. State machine 152 generates an I/O 
Valid signal, which is directed to AND gate 164, as well as 
being passed back to the system frequency domain via 
synchronization circuit 130. 

By way of illustration, if, through a cache coherency 
check, it is determined that data within the data storage area 
is no longer valid (e.g., new data values written into system 
memory), then state machine 154 asserts the Invalidate 
signal line. This signal passes through synchronization cir- 
cuit 142 and is input to state machine 152. In response, state 
machine 152 negates the I/O Valid signal line, immediately 
signaling in the I/O firequency domain that the data is not 
valid. The I/O Valid signal is also directed to state machine 
154, via synchronization circuit 130, Therefore, after a few 
clock cycle delay, the negated I/O Valid signal is input to 
state machine 154, which negates the Sys. Valid signal line 
in response. la addition, the Invalidate signal line may be 
de-asserted. The Sys. Valid signal is directed to AND gate 
168, to indicate in the system frequency domain that the data 
is no longer valid. The Sys. Valid signal is also directed to 
state machine 152 via synchronization circuit 140, Thus, 
through this signaling it is observed that the validity status 
of data within the data storage area 120 is first indicated as 
invalid on the I/O side, before it is indicated as invalid on the 
system side. 

Likewise, when data is retrieved from system memory 
into the data storage area 120, then state machine 154 may 
assert the Sys. Valid signal line. This signal is delivered to 
state machine 152, which asserts the I/O Valid signal line. 
Conversely, if data within the data storage area is to be 
marked as invalid from the I/O frequency domain side, then 
state machine 152 simply negates the I/O Valid signal (even 
without being first prompted by the Invalidate signal line). 
This immediately signals (via AND gate 164) in the I/O 
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frequency domain that data in the data storage area 120 is 
not valid. Then, several cycles later, data will be indicated as 
invalid (via AND gate 16S) in the system frequency domain. 
Thus, state machines 152 and 154 are provided, which 

5 generate and control the status of the Invalidate, Sys. Valid, 
and I/O Valid signal lines. For example the first state 
machine 154 may be configured to generate an Invalidate 
signal in response to coherency transactions on the system 
data bus. The second state machine 152 may be configured 
to generate an I/O Valid signal in response to a completed 
fetch operation, whereby data is fetched from system 
memory into the data storage area. The specific circuitry 
used to implement the state machines 152 and 154 is not 
deemed to form part of the present invention, and therefore 
is not illustrated herein. Indeed, this circuitry may be imple- 
mented in a variety of different ways depending upon the 
particular application of device 102 and the particular imple- 
mentation chosen for the device 102. Notwithstanding, 

2Q persons of ordinary skill in the art should appreciate a 
number of ways to implement the state machines 152 and 
154, and therefore a detailed discussion of the same is not 
necessary herein. 
In addition, comparison circuitry 162, 164, 166, and 168 

25 are provided to indicate whether particular data (as identified 
by an address) is presently within the data storage area 120. 
This circuitry may be implemented in a conventional 
manner, wherein comparison logic 162 is provided to deter- 
mine whether a given address is presently within the address 
storage area 122. If so, the output of the comparator 162 is 
effectively ''ANDed'* with the validity status as reflected by 
the I/O Valid signal line. That is, in order to generate a ''hit" 
for an I/O request, the address of the specified data must not 

35 only be present within the address storage area 122 (as 
determined by comparator 162), but that data must also be 
valid (as detennined by state machines 152 and 154). 
Similarly, a request for data from the system side of the 
device 102 will result io a ''hit" only if the address is present 
within the address storage area 122 (as determined by 
comparator 166) and the data status is valid (as determined 
by the state machines 152 and 154). 
As will be appreciated, data requested from the I/O data 

45 bus typically will be fetched from system memory 104 into 
the data storage area 120. As previously mentioned, in the 
preferred embodiment, the I/O data bus 110 is a PCI bus. As 
is known, there is a mechanism provided in connection with 
PCI communications, whereby if data requested from sys- 

50 tem memory is not available within the cache of device 102, 
then the requesting circuitry will re-request the data several 
cycles later. Accordingly, if data requested from the PCI bus 
is not presently within the cache of the device 102, a hit (as 
reflected by gate 164) will not be generated. The system of 

55 the present invention will then request the data from system 
memory 104 and retrieve it into the data storage area 120. 
Then, a subsequent read from the PCI bus for that data wiU, 
in fact, generate a hit and data can be immediately read from 
the data storage area 120 to the PCI bus, without incurring 

60 latency. Consistent with the invention, data may be pre- 
fetched from the system memory 104 such that the latency 
associated with a fetch or request for data from the PQ bus 
is encountered only in connection with the initial fetch, and 
data from subsequent fetches may be immediately retrieved 

65 from the data storage area 120. 

In the same way, as devices that are di^osed in commu- 
nication with the system bus 108 initiate requests for data 
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from system memory 104, or write data to locations within performance of a cache memory manager 202 of cache 

a system memory 104, the device 102 of the present inven- memory 126. In accordance with the broad aspect of the 

tion must ensure coherency of data between the system invention, the memory manager simply ensures that a 

memory 104 and the data storage area 120. In a manner that certain, predetermined amount of memory space remains 
is known, this coherency may be maintained through a 5 available at all times. To achieve this, the memory manager 
snoopy cache coherency process. It will be appreciated from 1 r I 

the foregoing discussion, that when a sy^em address is JjG^—ously monitors the memory space of the cache, 
presented to comparator 166 to determine whether data is space that is immediately available to receive 

presently located within the data storage area 120 (as new data (i.e., cache lines presently containing no data) falls 

specified by an address within the address storage area 122), below a certain predetermined amount, then the memory 

the validity circuitry, including state machines 152 and 154, manager operates to flush one or more lines of cache data, 
provide an immediate indication as to whether the data , - , ^ • . . . . ^ 

Within the dau storage area 120 is vaUd, without incurring ^ ^5^^^°^ embodying the present invention, it is deter- 

the latency delays otherwise associated with crossing the mined whether new data is delivered to the cache upon, for 

frequency boundary 116. It will be appreciated that this example, a data fetch (step 302). Once a request is detected, 

advantageously provides for fast miss detection (i.e., data the memory manager 202 then immediately reads the data 

not in stor^e area 120) of data. However, if data is detected, from system memory into tlw memory space of the cache 

then more information wiU need to be obtained and latency memory (step 304). In accordance with the invention, no 

wiU be incurred Since the present inveritionfe ^^^^ue latency delay need be incurred here, since the 

an I/O cache, however, vast majority of the time, ^ ^ ^02 may assume that memory space 
requested data will not be present m the cache, thereby ^ ^ ' . . r 

obtaining the benefits of the quick miss detection the vast immediately available for 

majority of the time. storage. 

Having described certain basic aspects of both cache Thereafter, the memory manager 202 controls the place- 
memory devices, the discussion will now be directed to the ment of data read from the system memory 104 into the 
prefened embodiment of the present invention. In this memory space 204 of the cache memory 126. This step 
regard, the present invention is directed to a system and implicitly may include updating the directory 206 with the 
method for managing the data within a cache memory, lag(s) for the data line(s) read into the cache memory 126, 
whereby data that is not expected to be used again in the near "'^T'^ manager 202 controls the communication 
future is immediately discarded from the cache. 30 ^""^ ^° ^^^^ ^^)- 

System and Operation of the Present Invention accordance with the preferred enibodiment of the 

^ . . . ^ present invention, the memory manager 202 ensures that a 

Having described the operatmg environment of the pre- predefined or predetermined amount of space is maintained 

fened embodunent of the present mvention, reference is ^ "available" for new cache data (step 308). As will be 
now made to FIG. 5, which is a block diagram iUustraUng 35 ^^^^ this "available" space may be programma- 
certain fundamental components of a cache memory 126. defined iising a flush control register 208. If at least the 

constructed in accordance with the present invention. predetermined amount of date is determined to be avaikble, 

PrincipaUy the cache memory 126 includes a meinory ^^n the system may return to step 302, and await the next 

manager 202.memoryjaoe2H and a directoiy/igdex 206. jata fetch. If, however, less than the predefined amount of 
The memory space 204 corresponds to the data Lnes dis- ^ ^^^^^ ^^^^y ^ ^^^j^^,^ ^^ ^^^^ 

«nissedmcomiectionwithFlG.l.Theinemorymanager202 jiscanled (or additional lines, if necessary to clear the 

IS preferably implemented in hardware, comprising inte- p^defined amount of space) (step 310). 
grated circuits specifically configured to carry out various 

functions. In one respect, a portion of the memory manager 45 I" accordance with the preferred embodiment of the 

202 may be viewed as comprising a plurality of functional present invention, the memory manager 202 utilizes a flush, 

units 210, each made up from dedicated hardware. For Control register 208, which provides a mechanism that 

example, and as will be flirther discussed below in connec- allows the amount of available space to be programmably 

tion with FIG. 6, these functions may include discarding varied. Specifically, and as previously described, memory 

cache data to make space available 214, reading data into the fetches from an I/O bus across the frequency boundary 116 

cache 216, writing data out to the PCI bus 218, and evalu- necessarily incur several latency cycles. This latency delay 

ating or checking the flush control register 220. The specific is minimized by placement of the device across 

implementation of the functions listed above may be made (conceptually) the frequency boundary 116, incurring only 
in different ways, consistent with the concepts and teachings 55 latency associated with synchronizing signal 

of the present invention. transfers across the frequency boundary 116. The present 

As illustrated, the memory manager may further include invention further minimizes any delays by insuring that 
additional elements, such as a mechanism 232 for determin- ^ always ^ace available within the cache memory to 

ingaquantityof available space within a data storage area, receive new data. Although there is some latency even 

a mechanism 234 for comparing the quantity of available associated with retrieving data into clean cache lines, the 

space with a predetermined value, and a mechanism 236 for invention reduces latency delays that may occur when the 
controllably flushing at least one line of data from the data memory is full, and data must first be discarded before 

storage area of the cache memory if the available space is ™y ^ fetched into the cache memory, 

less than the predetermined value. ^5 in accordance with the preferred embodiment of the 

In accordance with one aspect of present invention, and invention the flush control register has the form illustrated in 

referring to FIG. 6, a method is provided for improving the TABLE 1, below: 
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Specifically, the preferred flush control register is a sixty- 
four bit register. The sixteen most significant bits (bits 
63-48) are reserved (presently unused), and default to a 
value of (X)00 (Hex) upon reset initialization. The following 
sixteen bits (bits 47-32) comprise a line enable, and default 
to OxFFFF (Hex) upon reset initialization. The line enable 
function provides added fault tolerance and test scenarios. 
The following twenty bits (31-12) arc reserved and default 
to 0(XK)0 (Hex) upon reset initialization. Likewise bits 2-7 
are reserved and default to 0 upon initialization. Bits 0-1 and 
8-11 are defined as set forth in TABLE 2, immediately 
below. 



Finally, and in accordance with the invention, bits 0 and 
1 (denoted as FLUSH_LEVEL) are bits that define the 
amount of space (in terms of cache lines) that the memory 
manager 202 is to maintain as "available" at all times. In 
accordance with the preferred embodiment, the number of 
cache lines that are maintained as "available" is actually one 
more than the binary number specified by the two bit values 
of the flush control register 208. Therefore, the device 102 
of the preferred embodiment is programmably controllable 
to vary the number of available cache lines between one and 
four. Alternatively, the two bit positions may be configured 
to specify firom zero to three available cache lines. 



25 



TABLE 2 







Reset 




Rinction 


Bits 




De script ion 


WPF_JEMPTY 


11 


1 


This bit indicates that the wriie-posting FIFO 
is empty. This bit is set before DMA can be 
considered inactive. 


FLUSIL-CACHE 


10 


0 


Writing 1 to this bit sends a flush signal to all 
cache lines. Writing a 0 has no effect. Reads 
of this bit indicate the flush status of the 
cache. Al indicates that flushes are still 
pending 


AUTO_W_FLUSH 


9 


1 


Enables the cache to flush a line when the last 
byte of the line is writteo. 


AUTO__RD_FLUSH 


8 


1 


Enables the cache to flush a tine when the last 
word of the line is read. 


FLUSH_LEVEL 


0-1 


11 


Seta the number of cache tines the Memory 
Manager attenq)t8 to keep empty to 
FLUSHL-LEVEL + 1. 



Specifically, bit 11 (denoted as WPF_EMPTY) indicates 
whether a write-posting FIFO is empty. This bit must be 
observed as "Set" before direct memory accesses (DMAs) 
are considered inactive. Bit 10 (denoted as FLUSH_ ^5 
CACHE) indicates whether cache line flushes are pending. 
In this regard, bit 10 is set (writing a 1 to this bit position) 
to send a flush signal to all cache lines. Circuitry automati- 
cally clears bit 10, when the flush is complete. Therefore, bit 
10 may be monitored to determine when a flush is complete, 
Therefore, reading the bit position will indicate the status of 
the flush operation, whereby a one indicates that the flush 
operation is not completed. Bits 8 and 9 (denoted as AUTO_ 
RD_FLUSH and AUTO_W_FLUSH, respectively) enable 
the cache to flush a line when the last byte of the line is read 
or written, respectively. Thus, for certain types of data it may 65 
be desirable to immediately discard (or flush) the data as 
soon as it is read or written. 



Of course, consistent with the concepts and teachings of 
the present invention, additional bit positions within the 
flush control register may be defined to control the number 
of available lines. For example, in an alternative 
embodiment, the flush control register may define four bit 
positions that specify the available space within the cache 
memory. In such an embodiment, sixteen different levels of 
availability may be specified. In one such embodiment, the 
four bit positions may specify from one to sbcteen cache 
lines as ''available". In yet another embodiment, the four bit 
positions may specify from zero to fifteen cache tines as 
"available." 

Having described preferred embodiments of the present 
invention above, it will be appreciated by those skilled in the 
ait that the broad concepts of the present invention are 
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applicable to other systems as welL In this regard, the broad 
objective of the present invention is to enhance system 
performance by discarding at least one line of data when the 
capacity of the cache memory space exceeds a predeter- 
mined threshold. Stated another way, the cache memory of 5 
the present invention is designed to maintain a certain, 
predefined (but programmably variable) amount of space 
that remains constantly available to receive new data. As 
new data is read into this space, leaving less than the 
minimum amount of space, then at least one (or more) line 
of cache data is discarded, freeing up additional space. This 
leaves a certain amount of space within the cache readily 
available to receive new data, without having to first discard 
other data maintained within the cache memory. The method 15 
or algorithm used to discard cache data, in order to make 
space available, may vary consistent with the invention. In 
this regard, the invention may employ a least recently used 
(LRU) algorithm, or some other algorithm for freeing up 
space. 

One advantage of a device constructed in accordance with 
the invention is that it reduces latency delays associated with 
the flushing of data from cache memory, when new data is 
sought to be read into the cache, but no space is available. 
The tradeoff, however, to realize this advantage is that the 
invention effectively reduces the size of the cache. 

The cache of the prefened embodiment is a sixteen line 
cache, wherein each line includes sixty-four bytes of data. 

The foregoing description is not intended to be exhaustive 
or to limit the invention to the precise forms disclosed. 
Obvious modifications or variations are possible in light of 
the above teachings. In this regard, the embodiment or 
embodiments discussed were chosen and described to pro- 
vide the best illustration of the principles of the invention 
and its practical application to thereby enable one of ordi- 
nary sldll in the art to utilize the invention in various 
embodiments and with various modifications as are suited to 
the particular use contemplated. All such modifications and 
variations are within the scope of the invention as deter- 
mined by the appended claims when interpreted in accor- 
dance with the breadth to which they are fairly and legally 
entitled. 

What is claimed is: 

1. In a computing system having a cache memory inter- 
posed between a system memory and an I/O bus, a method 
for managing data within the cache memory comprising the 
steps of: 

(a) providing a single data storage area in the cache 
memory in communication with both a system data bus 
and an I/O data bus; 

(b) providing a single address storage area configured to 
store system memory addresses corresponding to data 
contemporaneously stored in the data storage area; 

(c) identifying a request for data from the system memory 55 
by the device, wherein the data requested is not pres- 
ently in the single data storage area of the cache 
memory; 

(d) reading the data from the system memory into the 
single data storage area of the cache memory; 59 

(e) evaluating the cache memory to determine whether at 
least a predetermined amount of space within the single 
data storage area is readily available; 

(f) flushing a portion of the data within the single data 
storage area, in response to step (c), if at least the 65 
predetermined amount of space is not readily available; 
and 
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(g) repeating steps (c) through (Q. 

2. The method as defined in claim 1, wherein the data 
storage area comprises sixteen lines, each having sixty four 
bytes of data. 

3. The method as defined in claim 1, wherein step (b) 
includes immediately reading data from memory into an 
available line of the data storage area. 

4. The method as defined in claim 1, wherein step (b) 
includes determining the amount of data to be read from the 
system memory into the cache memory and reading the 
appropriate number of cache lines of data into the cache 
memory. 

5. The method as defined in claim 1, wherein the prede- 
termined amount of space is programmably configurable. 

6. The method as defined in claim 5, wherein the prede- 
termined amount of space is programmably configurable so 
that from one to four lines of storage area remain readily 
available at all times. 

7. In a computing system having a cache memory inter- 
posed between a system memory and a PCI bus, wherein 
data read from the system memory to the PCI bus is read 
through the cache memory, a method for managing data 
within the cache memory comprising the steps of: 

(a) providing a single data storage area in the cache 
memory in communication with both a system data bus 
and an I/O data bus; 

(b) providing a single address storage area configured to 
store system memory addresses conesponding to data 
contemporaneously stored in the data storage area; 

(c) determining a quantity of available space within the 
data storage area; 

(d) comparing the quantity of available space with a 
predetermined value; and 

(e) flushing at least one line of data from the cache 
memory if the quantity of available space is less than 
the predetermined value. 

8. The method as defined in claim 7, wherein the step of 
comparing includes evaluating a flush control register that 
specifies the predetermined value. 

9. The method as defined in claim 8, wherein the value 
specified by the flush control register is programmably 
variable. 

10. A system for managing data in an asynchronous I/O 
cache memory comprising: 

a single address storage area configured to store system 
addresses corresponding to data contemporaneously 
stored in the data storage area; 
memory space comprising a plurality of data lines within 
the cache memory, the memory space comprising a data 
storage area in communication with both a system data 
bus and an I/O data bus, wherein the data storage area 
is configured to store a non-duplicative data set; 
a single address storage area configured to store system 
addresses corresponding to data contemporaneously 
stored in the data storage area; 
a memory manager within the cache memory configured 
to manage the data within the memory space, the 
memory manager further including: 
first means for determining a quantity of available 

space within a data storage area; 
second means for comparing the quantity of available 

space with a predetermined value; and 
third means for controllably flushing at least one line of 

data from the data storage area of the cache memory 

if the available space is less than the predetermined 

value. 
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11. The system as defined in claim 10, wherein the first 
means evaluates a directory containing addresses of data 
stored within the data storage area. 

12. The system as defined in claim 10, wherein the second 
means includes a flush control register. s 

13. The system as defined in claim 12, wherein the flush 
control register includes a plurality of bits that specify the 
predetermined value. 
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14. The system as defined in claim 13, wherein the 
plurality of bits are programmably variable. 

15. The system as defined in claim 13, wherein the third 
means flushes at least one line of data from the data storage 
area, to maintain at least the predetemiined value of cache 
lines available to receive new data at all times. 

* « « « * 
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It is certified that error appears in the above-identified patent and that said Letters Patent is 
hereby corrected as shown below: 



Title page. 

Item [57], ABSTRACT, 
Line 3, delete "input/" 

Line 4, delete "output (I/O)" and insert therefor - I/O - 

Line 10, delete "free spaced" and insert therefor ~ free space ~ 

Line 11, after "lines are" delete "flushed or" 
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